HUMAN TRANSCRIPTION FACTOR IIA 
Cross Reference to Related Applications 

[0001] This application is a divisional of and claims priority under 35 U.S.C. § 120 to 
U.S. Application No. 08/845,011, filed April 22, 1997, which is a divisional of and claims 
priority under 35 U.S.C. § 120 to U.S. Application No. 08/411,635, filed April 11, 1995 
(now U.S. Patent No. 5,652,117, issued on June 29, 1997), which was the National Stage 
of International Application No. PCT/US94/10644, filed September 20, 1994, each of 
which is hereby incorporated by reference herein in its entirety. 

Field of the Invention 

[0002] This invention relates to newly identified polynucleotides, polypeptides encoded 
by such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
production of such polynucleotides and polypeptides. More particularly, the polypeptide of 
the present invention is the small (y) subunit of human transcription factor IIA, sometimes 
hereinafter referred to as "small subunit". The invention also relates to inhibiting the action 
of such polypeptides. 

Background of the Invention 

[0003] In prokaryotes, simply mixing purified RNA polymerase, a template carrying a 
promoter, nucleoside triphosphates, and appropriate buffer and salts is sufficient to obtain 
specific gene transcription in vitro beginning at the correct sites. Purified RNA polymerase 
from eukaryotes, however, initiates transcription very poorly and essentially at random. 
Accordingly, accessory factors are required for accurate initiation of transcription in 
eukaryotes. Some of these transcription factors are general factors required for initiation at 
all promoters, while others are gene-specific and are required only for certain promoters. 
Among the general factors is a protein called Transcription Factor HD "TFHD", which binds 
to a TATA sequence, wherein T represents thymidine and A represents adenosine, in 
promoters. Other general factors are also involved in the assembly of a multicomponent 
protein complex at the promoter. 

[0004] In general, ti-anscription factors are found to contain two functional domains, one 
for DNA-binding and one for transcriptional activation. These functions often reside within 
circumscribed structural domains that retain their function when removed from their natural 
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context. The DNA-binding domains of transcription factors fall into several structural 
families based on their primary amino acid sequence. 

[0005] In order to identify the specific nucleotides that control gene expression, regions 
of the gene flanking the coding region can be sequenced. Comparisons of these sequences 
reveal common patterns near the 5' and 3' ends of different genes. These are predicted to be 
important for proper transcription by RNA polymerase. The most common motif is the 
TATA sequence around 30 bp from the transcriptional start site. Other conserved sequences 
have been found roughly 50 to 100 bp upstream of the transcriptional start site. 
[0006] Eukaryotic transcriptional activation requires the characterization of several 
multiprotein complexes, referred as general transcription factors and coactivators 1 ' 2 . The 
heteromeric general transcription factor TFUA binds directly to the TATA binding protein 
(TBP) 3 ' 4 and has been implicated in the process of transcriptional activation 5 " 8 . The y subunit 
of TFUA binds weakly to the TATA binding protein, but strongly stabilized the binding of 
the large subunit of TFUA (a (3) to TBP. Recombinant human TFUA is functional for the 
transcriptional activation mediated by at least three distinct activators. Both the a p and y 
subunits are essential for activator dependent stimulation of TFUD by binding to promoter 
DNA, thus facilitating the first step in pre-initiation complex formation. This demonstrates 
that TFUA is an evolutionary conserved general transcription factor important for activator 
regulated transcription. 

[0007] The interaction of TFUA with the general transcription factor UD (TFUD) has 
been shown to be rate-limiting step in the transcriptional activation process 5 . TFUA binds 
directly to TBP 3 ' 4 , the DNA binding subunit of the multiprotein TFUD complex 12 . TBP 
associated factors (TAFs) 12 " 14 , which are essential for activated transcription, are also 
required for an activator-dependent stimulation of the TFIIA-TFIID-promoter complex 6 . 
While TFUA has no known function in unregulated basal transcription 15 , it has been 
postulated that TFUA plays a role in preventing inhibitors of TFUD from repressing 
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transcription 

[0008] A TFUA homolog has been identified in yeast 9 " 10 , and the genes encoding the two 
subunits are essential for viability 11 . Human TFUA consists of three polypeptides (a, P, y), 
but the two largest subunits are derived from a single gene which shares homology to the 
large subunit of yeast TFUA 7 ' 20,21 . Both the human and yeast protein bind to the evolutionary 



conserved domain of TBP 22 and stimulate transcription reconstituted with TFHD, but not 
TBP 17 ' 19 . 

Summary of the Invention 
[0009] The polypeptide of the present invention has been putatively identified as the y 
subunit of TFHA. This identification has been made as a result of amino acid sequence 
homology. 

[0010] In accordance with one aspect of the present invention, there is provided a novel 
mature polypeptide which is the small subunit of TFTiA, as well as fragments, analogs and 
derivatives thereof The polypeptide of the present invention is of human origin. 
[0011] In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptide. 

[0012] In accordance with yet a further aspect of the present invention, there is provided 
a process for producing such polypeptide by recombinant techniques. 

[0013] In accordance with yet a further aspect of the present invention, there is provided 
a process for utilizing such polypeptide, or polynucleotide encoding such polypeptide for 
therapeutic purposes, for example, for regulating gene-specific or global transcription and to 
counteract repressors of the TFHD complex. 

[0014] In accordance with yet a further aspect of the present invention, there are 
provided antibodies against such polypeptides. 

[0015] In accordance with yet another aspect of the present invention, there are provided 
antagonists to such polypeptides, which may be used to inhibit the action of such 
polypeptides, for example, to inhibit transcription of undesired cells, e.g., malignancies. 
[0016] These and other aspects of the present invention should be apparent to those 
skilled in the art from the teachings herein. 

Brief Description of the Drawings 
[0017] The following drawings are illustrative of embodiments of the invention and are 
not meant to limit the scope of the invention as encompassed by the claims. 
[0018] Figure 1A depicts the cDNA sequence (SEQ ID NOT) and corresponding 
deduced amino acid sequence (SEQ ID NO:2) of the small subunit of human TFHA. The 
small subunit polypeptide shown is the mature form of the polypeptide. Standard one-letter 
abbreviations for amino acids are used. 
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[0019] Figure IB illustrates a comparison of the amino acid composition of the human 
TFUA small subunit (SEQ ID NO:2) and the yeast (TOA2) TFHA small subunit (SEQ ID 
NO:5.) 

[0020] Figure 1C is a schematic diagram depicting the human TFUA subunits. TFUA is 
encoded by two genes, ap and y. The ap protein is processed post-translationally into two 
polypeptides (a and (3), approximately 35 and 19 kDa, respectively. Recombinant a and p 
polypeptides were designed with a breakpoint at amino acid residue 251, but this may not 
exactly correspond to the naturally occurring proteolytic cleavage site. 
[0021] Figures 2A-B illustrate the functional activity of recombinant human, yeast and 
heterologous TFUA. Figure 2A illustrates the formation of a D-A complex (resulting from 
the addition of TFUA small subunit to TBP) bound to DNA detected by polyacrylamide gel 
EMS A (electrophoresis mobility shift assay). A 29 bp oligonucleotide probe containing the 
adenovirus E1B TATA element was incubated with various preparations of TFUA in the 
absence (-) or presence (+) of 10 ng of yeast TBP as indicated above each lane. 
Approximately 50 ng of recombinant TFUA was incubated in each reaction. Partially 
purified human TFUA (hUA, lanes 3,4), combinations of recombinant human subunits (ap, 
a, p, y, lanes 6-16), recombinant yeast c*p with yeast y (hap/yy,lanes 19,20) are indicated 
above each lane. 

[0022] Figure 2B illustrates the requirement of TFUA activity in reconstitution of 
transcriptional activation by the Epstein-Barr virus encoded activator, Zta transcriptional 
activator. Transcription reactions were reconstituted with immunoaffinity purified TFUD, 
recombinant TFUB, partially purified RNA polymerase U, TFUE, TFUF, and USA with (+) 
or without (-) Zta. Various TFUA preparations were added to reactions as indicated above 
each lane. Arrow at the left indicates the correctly initiated transcript. 

[0023] Figure 3A illustrates the interaction of TFUA subunits with TBP. 35 S-labelled 
TFUA y (lanes 1-3), ap (lanes 4-6), ap+y (lanes 7-9) or T3 luciferase control (lanes 10-12) 
proteins were incubated with GST (lanes 2, 5, 8, 11,) or GST-TBP (lanes 3, 6, 9, 12) 
immobilized on glutathione sepharose beads, as indicated above each lane. Lanes marked 
input (lanes 1, 4, 7, 10) represent approximately 2.5% of the reaction input. 
[0024] Figure 3B illustrates that the interaction of TFUA subunits reveals a strong 
homotypic binding of TFUA y. 35 S-labelled TFUA y (lanes 1-4), ap (lanes 5-8), TBP (lanes 
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9-12) or T3 luciferase (lanes 13-16) were incubated with GST (lanes 2, 6, 10, 14), GST-aP 
(lanes 3, 7, 1 1, 15), or GST-y (lanes 4, 8, 12, 16) fixed to glutathione sepharose beads. 
[0025] Figure 4A illustrates that recombinant TFHA restores the ability of three distinct 
activation domains to function in TFIIA depleted nuclear extracts. The transcriptional 
activator proteins Zta (lanes 2, 4, 6, 7, 8, 9), GAL4-AH (lanes 10, 12, 14) or VP16 (lanes 16, 
18, 20) were incubated with untreated HeLa cell nuclear extract (lanes 1, 2, 9, 10, 15, 16) or 
with TFEA-depleted HeLa cell nuclear extracts (lanes 3-9, 11-14, 17-20) in in vitro 
transcription reactions. 50 ng of recombinant TFIIA ap^ (lanes 5, 6, 13, 14, 19, 20), TFIIA 
ap (lane 7), or TFIIA y (lane 8) was supplemented to depleted extracts. Correctly initiated 
primer extension products for Zta and GAL4 templates are indicated by the arrows at the left 
O and right, respectively. 

S 1 ! [0026] Figure 4B illustrates the requirement for TFIIA in the reconstitution of 

y transcriptional activation by an acidic activator with partially purified general transcription 

In factors. Transcription reactions were essentially the same as those described for Fig. 2B, 

U except that the GAL- AH activator and the G 5 E 1 BTC AT template were used. 

[0027] Figure 4C shows that recombinant TFIIA promotes an activator and TAF 
"jj dependent TFTTD promoter complex. Mg agarose gel EMS A of DNA binding reactions with 

"I immunopurified TFEDD (lanes 3-12), Zta (even lanes, and 13), and a 250 bp probe derived 

H- from the Z7E4TCAT promoter. Zta (20 ng), recombinant TFIIA (50 ng), and 0.1 

footprinting unit of TFHD were incubated with approximately 1 fmole of radiolabeled 

promoter DNA for 15 minutes at room temperature. 

Detailed Description of the Invention 

[0028] Sequencing inaccuracies are a common problem when attempting to determine 
polynucleotide sequences. Accordingly, the sequence of Figure 1A (SEQ TD NOS:l and 2)is 
based on several sequencing runs and the sequencing accuracy is considered to be at least 
97%. 

[0029] In accordance with an aspect of the present invention, there is provided an 
isolated nucleic acid (polynucleotide) (SEQ ID NO:l) which encodes for the mature 
polypeptide having the deduced amino acid sequence of Figure 1 A (SEQ ID NO:2) or for the 
mature polypeptide encoded by the cDNA of the clone deposited with the American Type 
Culture Collection (ATCC), 10801 University Boulevard. Manassas, Virginia 20110-2209 
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(present address), as ATCC Deposit No. 75809 on June 10, 1994. Since the deposit is 
being maintained under the terms of the Budapest Treaty, it will be made available to a 
patent office signatory to the Budapest Treaty. 

[0030] The polynucleotide of this invention (SEQ ID NOT) was discovered in a T-cell 
library. It is structurally related to the small subunit of yeast (TOA2) (SEQ ID NO:5). It 
contains an open reading frame encoding a protein of 109 amino acid residues (SEQ ID 
NO:2). The protein exhibits the highest degree of homology to yeast TOA2 with 40% 
identity and 50% similarity over the entire amino acid stretch. 

[0031] The polynucleotide of the present invention (SEQ ID NOT) may be in the form 
of RNA or in the form of DNA, which DNA includes cDNA, genomic DNA, and synthetic 
DNA. The DNA may be double-stranded or single-stranded, and if single stranded may be 
the coding strand or non-coding (anti-sense) strand. The coding sequence which encodes the 
mature polypeptide may be identical to the coding sequence shown in Figure 1A (SEQ ID 
NOT) or that of the deposited clone or may be a different coding sequence which coding 
sequence, as a result of the redundancy or degeneracy of the genetic code, encodes the same 
mature polypeptide as the DNA of Figure 1 A (SEQ ID NOT) or the deposited cDNA. 
[0032] The polynucleotide which encodes for the mature polypeptide of Figure 1A (SEQ 
ID NO:2) or for the mature polypeptide encoded by the deposited cDNA includes only the 
coding sequence for the mature polypeptide since the polypeptide is a nuclear protein which 
is not excreted to the outside of the cell. 

[0033] Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
polynucleotide which includes additional coding and/or non-coding sequence. 
[0034] The present invention further relates to variants of the hereinabove described 
polynucleotides which encode for fragments, analogs and derivatives of the polypeptide 
having the deduced amino acid sequence of Figure 1 A (SEQ ID NO:2) or the polypeptide 
encoded by the cDNA of the deposited clone. The variant of the polynucleotide may be a 
naturally occurring allelic variant of the polynucleotide or a non-naturally occurring variant 
of the polynucleotide. 

[0035] Thus, the present invention includes polynucleotides encoding the same mature 
polypeptide as shown in Figure 1 A (SEQ JD NO:2) or the same mature polypeptide encoded 
by the cDNA of the deposited clone as well as variants of such polynucleotides which 



variants encode for a fragment, derivative or analog of the polypeptide of Figure 1 A (SEQ ID 
NO:2) or the polypeptide encoded by the cDNA of the deposited clone. Such nucleotide 
variants include deletion variants, substitution variants and addition or insertion variants. 
[0036] As hereinabove indicated, the polynucleotide may have a coding sequence which 
is a naturally occurring allelic variant of the coding sequence shown in Figure 1A (SEQ ID 
NO:l) or of the coding sequence of the deposited clone. As known in the art, an allelic 
variant is an alternate form of a polynucleotide sequence which may have a substitution, 
deletion or addition of one or more nucleotides, which does not substantially alter the 
function of the encoded polypeptide. 

[0037] The polynucleotides of the present invention may also have the coding sequence 
fused in frame to a marker sequence which allows for purification of the polypeptide of the 
present invention. The marker sequence may be a hexa-histidine tag supplied by a pQE-9 
vector to provide for purification of the mature polypeptide fused to the marker in the case of 
a bacterial host, or, for example, the marker sequence may be a hemagglutinin (HA) tag 
when a mammalian host, e.g. COS-7 cells, is used. The HA tag corresponds to an epitope 
derived from the influenza hemagglutinin protein (Wilson, I, et al., Cell, 37:767 (1984)). 
[0038] The present invention further relates to polynucleotides which hybridize to the 
hereinabove-described sequences if there is at least 50% and preferably 70% identity 
between the sequences. The present invention particularly relates to polynucleotides which 
hybridize under stringent conditions to the hereinabove-described polynucleotides . As 
herein used, the term "stringent conditions" means hybridization will occur only if there is at 
least 95% and preferably at least 97% identity between the sequences. The polynucleotides 
which hybridize to the hereinabove described polynucleotides in a preferred embodiment 
encode polypeptides which retain substantially the same biological function or activity as the 
mature polypeptide encoded by the cDNA of Figure 1 A (SEQ ID NO:2) or the deposited 
cDNA. 

[0039] The deposit(s) referred to herein will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the Deposit of Micro-organisms for 
purposes of Patent Procedure. These deposits are provided merely as convenience to those 
of skill in the art and are not an admission that a deposit is required under 35 U.S.C. §112. 
The sequence of the polynucleotides contained in the deposited materials, as well as the 
amino acid sequence of the polypeptides encoded thereby, are incorporated herein by 



reference and are controlling in the event of any conflict with any description of sequences 
herein. A license may be required to make, use or sell the deposited materials, and no such 
license is hereby granted. 

[0040] The present invention further relates to the small subunit of TFIIA polypeptide 
which has the deduced amino acid sequence of Figure 1A (SEQ ID NO:2) or which has the 
amino acid sequence encoded by the deposited cDNA, as well as fragments, analogs and 
derivatives of such polypeptide. 

[0041] The terms "fragment," "derivative" and "analog" when referring to the 
polypeptide of Figure 1A (SEQ ID NO:2) or that encoded by the deposited cDNA, means a 
polypeptide which retains essentially the same biological function or activity as such 
polypeptide. Thus, an analog includes a proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

[0042] The polypeptide of the present invention may be a recombinant polypeptide, a 
natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide. 
[0043] The fragment, derivative or analog of the polypeptide of Figure 1A (SEQ ID 
NO:2) or that encoded by the deposited cDNA may be (i) one in which one or more of the 
amino acid residues are substituted with a conserved or non-conserved amino acid residue 
(preferably a conserved amino acid residue) and such substituted amino acid residue may or 
may not be one encoded by the genetic code, or (ii) one in which one or more of the amino 
acid residues includes a substituent group, or (iii) one in which the mature polypeptide is 
fused with another compound, such as a compound to increase the half-life of the 
polypeptide (for example, polyethylene glycol), or (iv) one in which the additional amino 
acids are fused to the mature polypeptide, which are employed for purification of the mature 
polypeptide. Such fragments, derivatives and analogs are deemed to be within the scope of 
those skilled in the art from the teachings herein. 

[0044] The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 
[0045] The term "isolated" means that the material is removed from its original 
environment (e.g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 



and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that such vector or composition is not part of its natural environment. 
[0046] The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 
[0047] Host cells are genetically engineered (transduced or transformed or transfected) 
with the vectors of this invention which may be, for example, a cloning vector or an 
expression vector. The vector may be, for example, in the form of a plasmid, a viral particle, 
a phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 
TFUA genes. The culture conditions, such as temperature, pH and the like, are those 
previously used with the host cell selected for expression, and will be apparent to the 
ordinarily skilled artisan. 

[0048] The polynucleotides of the present invention may be employed for producing 
polypeptides by recombinant techniques. Thus, for example, the polynucleotide may be 
included in any one of a variety of expression vectors for expressing a polypeptide. Such 
vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other vector may be used as 
long as it is replicable and viable in the host. 

[0049] The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures, hi general, the DNA sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. Such procedures and others are deemed 
to be within the scope of those skilled in the art. 

[0050] The DNA sequence in the expression vector is operatively linked to an 
appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. As 
representative examples of such promoters, there may be mentioned: LTR or SV40 
promoter, the E. coli lac or trp, the phage lambda Pl promoter and other promoters known to 
control expression of genes in prokaryotic or eukaryotic cells or their viruses. The 
expression vector also contains a ribosome binding site for translation initiation and a 



transcription terminator. The vector may also include appropriate sequences for amplifying 
expression. 

[0051] In addition, the expression vectors preferably contain one or more selectable 
marker genes to provide a phenotypic trait for selection of transformed host cells such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. coli. 

[0052] The vector containing the appropriate DNA sequence as hereinabove described, 
as well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. 

[0053] As representative examples of appropriate hosts, there may be mentioned: 
bacterial cells, such as K coli, St re ptomyc es, Salmonella typhirrmriiim; fungal cells, such as 
yeast; insect cells such as Drnsopbila and SJ2; animal cells such as CHO, COS or Bowes 
melanoma; plant cells, etc. The selection of an appropriate host is deemed to be within the 
scope of those skilled in the art from the teachings herein. 

[0054] More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters 
are known to those of skill in the art, and are commercially available. The following vectors 
are provided by way of example. Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pbs, pDIO, 
phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A 
(Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: 
pWLNEO, pSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). However, any other plasmid or vector may be used as long as they are 
replicable and viable in the host. 

[0055] Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are PKK232-8 and PCM7. Particular named bacterial promoters include 
lad, lacZ, T3, T7, gpt, lambda Pr, Pl and trp. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
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mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 

[0056] In a further embodiment, the present invention relates to host cells containing the 
above-described constructs. The host cell can be a higher eukaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell can be 
effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, or 
electroporation. (Davis, L., Dibner, M., Battey, I., Basic Methods in Molecular Biology, 
(1986)). 

[0057] The constructs in host cells can be used in a conventional manner to produce the 
gene product encoded by the recombinant sequence. Alternatively, the polypeptides of the 
invention can be synthetically produced by conventional peptide synthesizers. 
[0058] Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other 
cells under the control of appropriate promoters. Cell-free translation systems can also be 
employed to produce such proteins using RNAs derived from the DNA constructs of the 
present invention. Appropriate cloning and expression vectors for use with prokaryotic and 
eukaryotic hosts are described by Sambrook, et al, Molecular Cloning: A Laboratory 
Manual, Second Edition, Cold Spring Harbor, N.Y., (1989), the disclosure of which is 
hereby incorporated by reference. 

[0059] Transcription of the DNA encoding the polypeptides of the present invention by 
higher eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers 
are cis-acting elements of DNA, usually about from 10 to 300 bp that act on a promoter to 
increase its transcription. Examples including the SV40 enhancer on the late side of the 
replication origin bp 100 to 270, a cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus enhancers. 
[0060] Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. noli and S rerevisiae TRP1 gene, and a promoter derived from a highly-expressed 
gene to direct transcription of a downstream structural sequence. Such promoters can be 
derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase 
(PGK), a-factor, acid phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and 
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termination sequences, and preferably, a leader sequence capable of directing secretion of 
translated protein into the periplasmic space or extracellular medium. Optionally, the 
heterologous sequence can encode a fusion protein including an N-terminal identification 
peptide imparting desired characteristics, e.g., stabilization or simplified purification of 
expressed recombinant product. 

[0061] Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable translation 
initiation and termination signals in operable reading phase with a functional promoter. The 
vector will comprise one or more phenotypic selectable markers and an origin of replication 
to ensure maintenance of the vector and to, if desirable, provide amplification within the 
host. Suitable prokaryotic hosts for transformation include E. coli , Bacillus subtihs , 
Salmonella typhimnrinm and various species within the genera Pseudomonas, Streptomyces, 
and Staphylococcus, although others may also be employed as a matter of choice. 
[0062] As a representative but nonlimiting example, useful expression vectors for 
bacterial use can comprise a selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 
pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, 
Madison, WI, USA). These pBR322 "backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 

[0063] Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. 
[0064] Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

[0065] Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use 
of cell lysing agents, such methods are well know to those skilled in the art. 
[0066] Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 lines 
of monkey kidney fibroblasts, described by Gluzman, Cell, 23:175 (1981), and other cell 
lines capable of expressing a compatible vector, for example, the CI 27, 3T3, CHO, HeLa 
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and BHK cell lines. Mammalian expression vectors will comprise an origin of replication, a 
suitable promoter and enhancer, and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, transcriptional termination sequences, 
and 5' flanking nontranscribed sequences. DNA sequences derived from the SV40 splice, 
and polyadenylation sites may be used to provide the required nontranscribed genetic 
elements. 

[0067] The TFHA small subunit polypeptides can be recovered and purified from 
recombinant cell cultures by methods including ammonium sulfate or ethanol precipitation, 
acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography 
hydroxylapatite chromatography and lectin chromatography. Protein refolding steps can be 
used, as necessary, in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final purification steps. 
[0068] The polypeptides of the present invention may be a naturally purified product, or 
a product of chemical synthetic procedures, or produced by recombinant techniques from a 
prokaryotic or eukaryotic host (for example, by bacterial, yeast, higher plant, insect and 
mammalian cells in culture). Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be glycosylated or may 
be non-glycosylated. Polypeptides of the invention may also include an initial methionine 
amino acid residue. 

[0069] While Applicant does not wish to limit the scientific reasoning in the present 
invention to any particular theory, the following procedures are illustrations of the functional 
aspects of TFIIA as a whole and the small (y) subunit, in particular. 

[0070] TFIIA activity was first assayed for the stabilization of TBP binding to a TATA 
box containing oligonucleotide probe in polyacrylamide gel EMS A (Fig. 2A). In the absence 
of TFHA, yeast TBP does not form a stable complex with a TATA box containing 
oligonucleotide (lane 2). Addition of partially purified human TFIIA (hUA) to TBP resulted 
in the formation of a stable complex (D-A) (lane 4). Recombinant a(3+y also produced a 
stable D-A complex (lane 6). TBP failed to form the stable D-A complex when the ap or y 
subunit was added individually (lanes 8 and 10). Since natural TFIIA occurs as three 
polypeptides, the a|3 protein was expressed as two independent subunits (Fig. 1C). The a-Hy 
had no effect (lane 12), while (3+7 had a small effect (lane 14) on TBP binding. In contrast, 
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the combination of a, p and y submits (ap4j) resulted in strong stimulation of TBP binding, 
while electrophoretic mobility was very similar to the native human TFIIA protein (lane 16). 
In addition, the yeast y subunit could also form a stable D-A complex when mixed with the 
human ap subunit, demonstrating the interaction between the ap and y subunits is 
evolutionarily conserved (lane 20), The electrophoretic mobility of the D-A complex was 
influenced by the different apy forms of TFIIA, suggesting that TFIIA is retained in the 
bound complexes. 

[0071] The various TFIIA complexes were analyzed for their ability to support 
transcriptional activation in reactions reconstituted with partially purified general 
transcription factors, the coactivator USA 25 , and the Epstein-Barr virus encoded activator, 
Zta 6 . The y subunit was essential for transcriptional activation, and the a and p subunit could 
be supplemented as either a single ap polypeptide, or as two distinct polypeptides (a+p) (Fig. 
2B). In all cases, the formation of the D-A complex in EMS A correlated with the ability of 
TFIIA to support transcriptional activation by Zta. 

[0072] Radio labelled ap, y or ap+y proteins were tested for their ability to interact with 
glutathione-S-transferase (GST) or GST-TBP fusion proteins immobilized on glutathione 
agarose (Fig. 3 A). The y subunit bound weakly, but specifically to GST-TBP (lane 3), as did 
the ap subunit (lane 6). Significantly, the combination of the ap+y subunits markedly 
increased the binding to GST-TBP (lane 9). The ability of radiolabeled TBP to bind to 
GST-y and GST-aP was also examined (Fig. 3B). TBP bound to GST-ap protein, but failed 
to interact with the GST-y protein (Fig. 3B, lanes 1 1 and 12). The discrepancy of the binding 
of y to TBP may be a partial result of the steric hindrance of GST fused to the amino 
terminus of y. Although both aP and y are capable of making direct contact with TBP, the 
heterodimer clearly binds with higher affinity. 

[0073] The interaction of y with the ap polypeptides was also examined by the GST- 
fusion protein binding assay. Radiolabeled y, ap, or T3 control were incubated with GST-y, 
GST-ap, or GST alone (Fig. 3B). As expected, the aP subunit bound to GST-y and the y 
subunit bound to GST-ap. The y subunit also bound strongly to GST-y, while ap did not 
bind GST-aP, suggesting that a homotypic association of the y subunit contributes to the 
oligomerization state of TFIIA. The T3 control protein did not interact with any of the GST 
proteins tested. 
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[0074] To determine whether TFIIA was required for transcriptional activation by 
activators distinct from Zta, the need for TFIIA by the acidic activator GAL4-AH 26 and the 
herpes virus derived activator GAL4-VP16 27 was examined. Zta is not an acidic activator, 
like AH and shares no obvious homology to the VP16 activation modules. TFIIA depleted 
HeLa cell nuclear extracts were prepared by serial passage over nickel agarose, which binds 
specifically to the ap subunit of TFIIA. 7 ' 20 Addition of Zta to the depleted extract failed to 
produce significant transcription levels (Fig. 4A, lane 4). Addition of TFIIA ap + y subunits 
restored activation of the depleted extracts to levels observed in the undepleted extract 
(compare lanes 2 and 6). The ap or y subunit alone failed to restore Zta activation in these 
depleted extracts indicating that both subunits were equally depleted by the nickel agarose. 
Similarly, GAL4-AH (lanes 9-14) and GAL4-VP16 (lanes 15-20) did not function in the 
TFIIA depleted extracts. Addition of recombinant aP+y subunits restored activator 
dependent transcription for all three distinct activators. GAL4-AH was also shown to require 
TFIIA in reconstituted transcription assays (Fig. 4B) as did Zta (Fig. 2B). These results 
indicate that distinct activation domains require TFIIA for activated transcription in both 
crude nuclear extracts, as well as in more purified reconstitution systems. 
[0075] Partially purified TFIIA is required for an activation domain stimulation of TFIID 
binding to promoter DNA. 6 Using Mg agarose gel EMS A, it was found that recombinant 
TFIIA substitutes for the partially purified TFIIA in this function (Fig. 4C). In the absence of 
TFIIA, Zta does not stimulate the formation of a TFITD-DNA complex (Fig. 4C, lane 4). 
However, the addition of partially purified TFIIA (lane 6) or afi+y (lane 12), but not ap or y 
alone (lanes 7-10) allows Zta stimulation of TFIID binding to the promoter DNA. These 
results demonstrate that TFIIA mediates an interaction between activators and TFIID, which 
result in the increase affinity of TFHD for promoter DNA. 

[0076] The isolation of the y subunit of human TFIIA has allowed testing for the 
requirement for highly purified TFHA in the reconstitution of activated transcription in vitro. 
Several distinct activation domains require TFIIA for their ability to function. The 
recombinant human y subunit has also been shown to be functionally interchangeable in in 
vitro transcription and TBP binding assays. 

[0077] The TFIIA small subunit polypeptide may be used to prevent inhibitors of TFHD 
from repressing transcription, this is useful where a particular gene product is desired and is 
not being produced at the desired levels due to the inhibition of the TFHD complex. 
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[0078] Most importantly, the TFIIA small subunit polypeptide may be used to regulate 
transcription globally or in a gene-specific manner, to obtain desired concentrations of 
particular proteins. For example, in the case of a malignancy, the TFIIA small subunit may 
be repressed to prevent transcription and in the case where a protein is desired, for example 
growth hormone, the TFIIA small subunit may enhance transcription and the production of 
the gene product. 

[0079] The polypeptide of the present invention is also useful for identifying other 
molecules which have similar biological activity. An example of a screen for this is isolating 
the coding region of the small subunit of the TFIIA gene by using the known DNA sequence 
to synthesize an oligonucleotide probe. Labeled oligonucleotides having a sequence 
complementary to that of the gene of the present invention are used to screen a library of 
human cDNA, genomic DNA or mRNA to determine which members of the library the 
probe hybridizes to. 

[0080] This invention provides a method of screening compounds to identify those 
which enhance (agonists) or block (antagonists) interaction of the small subunit of TFHA 
with TFiID. An agonist is a compound which increases the natural biological function of the 
small subunit of TFIIA, while antagonists eliminate such functions. As an example, purified 
RNA polymerase, a template carrying a promoter, nucleoside triphosphates and appropriate 
buffer and salts may be mixed with TFIIA and TFITD in the presence of the compound under 
conditions where transcription would normally take place. The ability of the compound to 
enhance or block the binding of TFIIA to the template DNA could then be determined by 
measuring the level of transcription product. 

[0081] Alternatively, the assay may be a cell-based assay wherein a TFHA-inducible 
promoter drives the expression of a marker gene. TFIIA and the compound to be screened 
would then be added to measure the level of production of the marker gene. Additionally, 
this cell-based assay could be used in tandem with the binding assay to determine if the 
effects on transcription are specific to TFIIA agonism or antagonism. 

[0082] Potential antagonists include an antibody, or in some cases, an oligonucleotide, 
which binds to the small subunit of TFIIA. Alternatively, a potential antagonist may be a 
closely related protein which binds to the TBP protein of the TFHD complex but do not 
initiate transcription. An example of such a closely related protein is a negative dominant 
mutant, wherein one the two subunits of TFIIA are mutated and do not retain function. The 
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negative dominant mutant, however, still recognizes substrate but does not initiate 
transcription. 

[0083] Another potential antagonist is an antisense construct prepared using antisense 
technology. Antisense technology can be used to control gene expression through triple- 
helix formation or antisense DNA or RNA, both of which methods are based on binding of a 
polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
sequence, which encodes for the mature polypeptides of the present invention, is used to 
design an antisense RNA oligonucleotide of from about 10 to 40 base pairs in length. A 
DNA oligonucleotide is designed to be complementary to a region of the gene involved in 
transcription (triple helix -see Lee et al., Nucl. Acids Res., 6:3073 (1979); Cooney et al, 
Science, 241:456 (1988); and Dervan et al, Science, 251: 1360 (1991)), thereby preventing 
transcription and the production of the TFDA small subunit. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA 
molecule into the TFIIA small subunit (antisense - Okano, J. Neurochem., 56:560 (1991); 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, 
FL (1988)). The oligonucleotides described above can also be delivered to cells such that the 
antisense RNA or DNA may be expressed in vivo to inhibit production of the small subunit 
of TFIIA. 

[0084] Potential antagonists include a small molecule which binds to and occupies the 
active site of the polypeptide such that TFIIA is unable to activate TFHD and initiate 
transcription. Examples of small molecules include but are not limited to small peptides or 
peptide-like molecules. 

[0085] The antagonists may be employed to inhibit the transcription of undesired 
polypeptides. For example, where a particular polypeptide leads to an undesired condition 
the antagonists mentioned above may be used to prevent transcription of that polypeptide. 
An example of this is the transcription and differentiation of cancerous cells. The 
antagonists may be employed in a composition with a pharmaceutically acceptable carrier, 
e.g., as hereinafter described. 

[0086] The polypeptides of the present invention may be employed in combination with 
a suitable pharmaceutical carrier. Such compositions comprise a therapeutically effective 
amount of the polypeptide, and a pharmaceutically acceptable carrier or excipient. Such a 
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carrier includes but is not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, 
and combinations thereof. The formulation should suit the mode of administration. 
[0087] The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions of 
the invention. Associated with such container(s) can be a notice in the form prescribed by a 
governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of manufacture, use or sale for human 
administration. In addition, the polypeptides of the present invention may be employed in 
conjunction with other therapeutic compounds. 

[00881 The pharmaceutical compositions may be administered in a convenient manner 
such as by the topical, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal or 
intradermal routes. The pharmaceutical compositions are administered in an amount which 
is effective for treating and/or prophylaxis of the specific indication. In general, they are 
administered in an amount of at least about 10 Lig/kg body weight and in most cases they will 
be administered in an amount not in excess of about 8 mg/Kg body weight per day. In most 
cases, the dosage is from about 10 ^ig/kg to about 1 mg/kg body weight daily, taking into 
account the routes of administration, symptoms, etc. 

[0089] The small subunit polypeptide of TFIIA and agonists and antagonists which are 
polypeptides may also be employed in accordance with the present invention by expression 
of such polypeptides in vivo, which is often referred to as "gene therapy." 
[0090] Thus, for example, cells from a patient may be engineered with a polynucleotide 
(DNA or RNA) encoding a polypeptide ex vivo, with the engineered cells then being 
provided to a patient to be treated with the polypeptide. Such methods are well-known in the 
art. For example, cells may be engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding a polypeptide of the present invention. 
[0091] Similarly, cells may be engineered in vivo for expression of a polypeptide in vivo 
by, for example, procedures known in the art. As known in the art, a producer cell for 
producing a retroviral particle containing RNA encoding the polypeptide of the present 
invention may be administered to a patient for engineering cells in vivo and expression of the 
polypeptide in vivo. These and other methods for administering a polypeptide of the present 
invention by such method should be apparent to those skilled in the art from the teachings of 
the present invention. For example, the expression vehicle for engineering cells may be 
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other than a retrovirus, for example, an adenovirus which may be used to engineer cells in 
vivo after combination with a suitable delivery vehicle. 

[0092] Fragments of the full length TFHA small subunit gene may be used as a 
hybridization probe for a cDNA library to isolate the full length TFHA small subunit gene 
and to isolate other genes which have a high sequence similarity to the gene. Probes of this 
type can be, for example, 30, 40, 50 75, 90, 100 or 150 bases. Preferably, however, the 
probes have between 30 and 50 base pairs. The probe may also be used to identify a cDNA 
clone corresponding to a full length transcript and a genomic clone or clones that contain the 
complete gene including regulatory and promoter regions, exons, and introns. The probe 
may be labelled, for example, by radioactivity to facilitate identification of hybridization. 
[0093] The sequences of the present invention are also valuable for chromosome 
identification. The sequence is specifically targeted to and can hybridize with a particular 
location on an individual human chromosome. Moreover, there is a current need for 
identifying particular sites on the chromosome. Few chromosome marking reagents based 
on actual sequence data (repeat polymorphisms) are presently available for marking 
chromosomal location. The mapping of DNAs to chromosomes according to the present 
invention is an important first step in correlating those sequences with genes associated with 
disease. 

[0094] Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the cDNA. Computer analysis of the cDNA is used to rapidly 
select primers that do not span more than one exon in the genomic DNA, thus complicating 
the amplification process. These primers are then used for PCR screening of somatic cell 
hybrids containing individual human chromosomes. Only those hybrids containing the 
human gene corresponding to the primer will yield an amplified fragment. 
[0095] PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular DNA to a particular chromosome. Using the present invention with the same 
oligonucleotide primers, sublocalization can be achieved with panels of fragments from 
specific chromosomes or pools of large genomic clones in an analogous manner. Other 
mapping strategies that can similarly be used to map to its chromosome include in situ 
hybridization, prescreening with labeled flow-sorted chromosomes and preselection by 
hybridization to construct chromosome specific-cDNA libraries. 
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[0096] Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphase 
chromosomal spread can be used to provide a precise chromosomal location in one step. 
This technique can be used with cDNA as short as 500 or 600 bases; however, clones larger 
than 2,000 bp have a higher likelihood of binding to a unique chromosomal location with 
sufficient signal intensity for simple detection. FISH requires use of the clones from which 
the EST was derived, and the longer the better. For example, 2,000 bp is good, 4,000 is 
better, and more than 4,000 is probably not necessary to get good results a reasonable 
percentage of the time. For a review of this technique, see Verma et al, Human 
Chromosomes: a Manual of Basic Techniques, Pergamon Press, New York (1988). 
[0097] Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
data. Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man 
(available on line through Johns Hopkins University Welch Medical Library). The 
relationship between genes and diseases that have been mapped to the same chromosomal 
region are then identified through linkage analysis (coinheritance of physically adjacent 
genes). 

[0098] Next, it is necessary to determine the differences in the cDNA or genomic 
sequence between affected and unaffected individuals. If a mutation is observed in some or 
all of the affected individuals but not in any normal individuals, then the mutation is likely to 
be the causative agent of the disease. 

[0099] With current resolution of physical mapping and genetic mapping techniques, a 
cDNA precisely localized to a chromosomal region associated with the disease could be one 
of between 50 and 500 potential causative genes. (This assumes 1 megabase mapping 
resolution and one gene per 20 kb). 

[0100] The polypeptides, their fragments or other derivatives, or analogs thereof, or cells 
expressing them can be used as an immunogen to produce antibodies thereto. These 
antibodies can be, for example, polyclonal or monoclonal antibodies. The present invention 
also includes chimeric, single chain, and humanized antibodies, as well as Fab fragments, or 
the product of an Fab expression library. Various procedures known in the art may be used 
for the production of such antibodies and fragments. 

[0101] Antibodies generated against the polypeptides corresponding to a sequence of the 
present invention can be obtained by direct injection of the polypeptides into an animal or by 
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administering the polypeptides to an animal, preferably a nonhuman. The antibody so 
obtained will then bind the polypeptides itself. In this manner, even a sequence encoding 
only a fragment of the polypeptides can be used to generate antibodies binding the whole 
native polypeptides. Such antibodies can then be used to isolate the polypeptide from tissue 
expressing that polypeptide. 

[0102] For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples include the 
hybridoma technique (Kohler and Milstein, 1975, Nature, 256:495-497), the trioma 
technique, the human B-cell hybridoma technique (Kozbor et al., 1983, Immunology Today 
4:72), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole, et 
al., 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 
[0103] Techniques described for the production of single chain antibodies (U.S. Patent 
4,946,778) can be adapted to produce single chain antibodies to immunogenic polypeptide 
products of this invention. 

[0104] The present invention will be further described with reference to the following 
examples; however, it is to be understood that the present invention is not limited to such 
examples. All parts or amounts, unless otherwise specified, are by weight. 
[0105] In order to facilitate understanding of the following examples certain frequently 
occurring methods and/or terms will be described. 

[0106] "Plasmids" are designated by a lower case p preceded and/or followed by capital 
letters and/or numbers. The starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures, hi addition, equivalent plasmids to those described are 
known in the art and will be apparent to the ordinarily skilled artisan. 

[0107] "Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically 1 ug of plasmid or DNA fragment is used with about 2 units of enzyme in 
about 20 ul of buffer solution. For the purpose of isolating DNA fragments for plasmid 
construction, typically 5 to 50 jag of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes 
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are specified by the manufacturer. Incubation times of about 1 hour at 37°C are ordinarily 
used, but may vary in accordance with the supplier's instructions. After digestion the 
reaction is electrophoresed directly on a polyacrylamide gel (or an agarose gel) to isolate the 
desired fragment, as described in Sambrook et al., Molecular Cloning: A laboratory Manual, 
Second Edition, Cold Spring Harbor, N.Y. (1989). 

[0108] "Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 
complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides have no 5' phosphate and thus will not ligate to another 
oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 
[01 09] "Ligation" refers to the process of forming phosphodiester bonds between two 
double stranded nucleic acid fragments (Maniatis, T., et al., Id., p. 146). Unless otherwise 
provided, ligation may be accomplished using known buffers and conditions with 10 units to 
T4 DNA ligase ("ligase") per 0.5 ug of approximately equimolar amounts of the DNA 
fragments to be ligated. 

[0110] Unless otherwise stated, transformation was performed as described in the 
method of Graham, F. and Van der Eb, A., Virology, 52:456-457 (1973). 

Exa mple 1 

ttar.tfirial Expression and Purification of the sma ll TFTTA subunit 

[0111] The DNA sequence encoding for the small TFHA subunit, ATCC # 75809, is 
initially amplified using PCR oligonucleotide primers corresponding to the 5' and sequences 
of the processed TFIIA subunit protein and the vector sequences 3' to the small subunit of 
TFHA gene. Additional nucleotides corresponding to the small subunit of TFHA were added 
to the 5' and 3' sequences respectively. In the case of the y subunit, the 5' oligonucleotide 
primer has the sequence 5'GCGGCGGATCCATGGCATATCAGGTATAC3' (SEQ ID 
NO:3) contains a Bam HI restriction enzyme site followed by 18 nucleotides of the small 
subunit of TFIIA (underlined) coding sequence starting from the presumed terminal amino 
acid of the processed protein codon. The 3' sequence 

v^rn^r a a ar rr A ttctgt AGT ATTGG T (SEQ ID NO:4) contains complementary 
sequences to Hindlll site and is followed by 13 nucleotides of the small subunit of TFIIA 
(underlined). The restriction enzyme sites correspond to the restriction enzyme sites on the 
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bacterial expression vector pQE-9 (Qiagen, Inc. 9259 Eton Avenue, Chatsworth, CA, 
9131 1). pQE-9 encodes antibiotic resistance (Amp r ), a bacterial origin of replication (ori), an 
IPTG-regulatable promoter operator (P/O), a ribosome binding site (RBS), a 6-His tag and 
restriction enzyme sites. pQE-9 was then digested with Bam HI and Hind HI. The amplified 
sequences were ligated into pQE-9 and were inserted in frame with 6 His residues fused to 
the amino terminus. The ligation mixture was then used to transform E. coli strain available 
from Qiagen under the trademark M15/rep 4 by the procedure described in Sambrook, J. et 
al, Molecular Cloning: A Laboratory Manual, Cold Spring Laboratory Press, (1989). 
M15/rep4 contains multiple copies of the plasmid pREP4, which expresses the lad repressor 
and also confers kanamycin resistance (Kan). Transformants are identified by their ability to 
grow on LB plates and ampicillin/kanamycin resistant colonies were selected. Plasmid DNA 
was isolated and confirmed by restriction analysis. 

[0112] Clones containing the desired constructs were grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). The 
O/N culture is used to inoculate a large culture at a ratio of 1:100 to 1:250. The cells were 
grown to an optical density 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG ("Isopropyl-B-D- 
thiogalacto pyranoside") was then added to a final concentration of 1 mM. IPTG induces by 
inactivating the lad repressor, clearing the P/O leading to increased gene expression. Cells 
were grown an extra 3 to 4 hours. Cells were then harvested by centrifugation. The cell 
pellet was solubilized in the chaotropic agent 6 Molar Guanidine HC1. After clarification, 
solubilized TFHA subunits was purified from this solution by chromatography on a Nickel- 
Agarose column under conditions that allow for tight binding by proteins containing the 6- 
His tag (Hochuli, E. et al, J. Chromatography 411:177-184 (1984)). The small subunit of 
TFHA (95% pure) was eluted from the column in 6 molar guanidine HC1 pH 5.0 and were 
allowed to renature by themselves of in stoichiometric combination with the specified 
protein (see Fig. 2). Gel Mobility Shift assays were used to separate the transcription 
products as they appear in the Figure. 
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Example 2 

Interaction of small subunit of TFIIA with glutathione-S-transferase 
or GST-TBP fusion proteins 

[0113] Bacterial extracts of GST or GST fusion proteins were incubated with glutathione 
sepharose-4B beads (6-9 ug of GST-fusion protein/20 ul of beads) with shaking at 4°C. 
After 2 hours the beads were washed with 50 column volumes of cold buffer A (20 mM 
NaH 2 P0 4 (pH 7.0) 150 mM NaCl, 1 mM DTT, 1 mM PMSF). Washed beads (20 ul) were 
then incubated with reticulocyte lysates containing 2xl0 4 cpm of 35 S labelled protein in 300 
ul of protein binding buffer (PBB) for 1 hour at room temperature. PBB contained 20 mM 
Hepes (pH 7.9), 20% glycerol, 0.5 mM EDTA, 60 mM KC1, 5 mM MgC12, 0.1% NP40, and 
5 mM p-mercaptoethanol. The beads were subsequently washed 4 times in PBB and labeled 
proteins were eluted with 1M KC1. Samples were analyzed on 15% SDS polyacrylamide 
gels, enhanced with NaSalycilate, and visualized by autoradiography. 

Example 3 

Transcription reactions utilizing T FTTA 

[0114] Transcription reactions contained 100 ng of the Z7E4TCAT 29 or G5E1BTCAT 26 
template, approximately 200 ng of activator protein, and 40 ug of nuclear extract in a 50 ul 
final reaction volume. TFUA depleted nuclear extracts were prepared by dialyzing HeLa cell 
nuclear extract in buffer D in 20 mM Hepes (pH 7.9), 20% glycerol, 5 mM (3- 
mercaptoethanol, 1 mM PMSF, containing 500 mM KC1, followed by two sequential 
incubations with Nickel agarose beads (150 ul packed beads/1 mg of nuclear extract) for 20 
minutes at 4°C rotating. Depleted extracts were dialyzed into D buffer containing 100 mM 
KC1. The reconstituted transcription reactions and the Mg agarose EMSA were described 
previously 6 . 

[0115] Numerous modifications and variations of the present invention are possible in 
light of the above teachings and, therefore, within the scope of the appended claims, the 
invention may be practiced otherwise than as particularly described. 
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